250

17

Genomics

Fig. 17.1 The major parts of genomics and their interrelationships. The passage from sequence to

function can bypass structure via comparison with sequences of known structure

function, focusing on the sequence similarities, may be sufficient. The comparison

of sequences of gene coding for the same (functionally speaking) protein in different

species forms the basis for constructing molecular phylogenies, via their differences.

The huge collections of gene and protein data now available have encouraged the

so-called “hypothesis-free” or “minimalist” approach to sequence analysis. 1 This is

discussed in Sect. 17.6. Possibly the greatest value of this approach is not so much

in elucidating particular phenomena such as a function of a specific gene, but rather

in approaching an answer to the broader question of the meaning of the genome

sequence, without the distraction of imposed categories such as “gene”, which may

be, as is currently all too apparent, very difficult to define unambiguously.

17.1

DNA Sequencing

The raw data used for genomic analysis are DNA sequences. This and the next section

briefly describe the major experimental approaches involved. For investigating the

RNA in the cell—the RNome, which has taken on a renewed importance since the

discovery of the so-called “noncoding” RNA (i.e., not ultimately translated into

protein)—the RNA would normally first have to be converted into complementary

DNA (cDNA).

1 It is sometimes said of this approach, rather disparagingly perhaps, that “one can apparently make

significant discoveries about a biological phenomenon without insight or intuition”. Possibly this

criticism derives from J. S. Mill’s view that deduction cannot produce new knowledge. At any rate,

it belies the fact that in reality once some unsuspected structural feature in the sequence has been

discovered purely by manipulating the symbols; a great deal of insight and intuition is generally

applied to make sense of it.